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1 Introduction 



The minority game, introduced by Challet and Zhang [[!]], is a simple model 
to mimic the internal dynamics of a market where agents can only buy or sell 
one commodity. At each time step, N agents choose independently between 
two different sides, or 1, that is, they have to choose recurrently between 
two different opportunities, like buying or selling. The side chosen by the 
minority of agents is identified as the winning side and all agents that have 
chosen this side are rewarded with a point. The points of the other agents, 
the losers, do not change. Agents strive to maximize their number of points. 
Such minority competitions are reminiscent of systems where individuals have 
to compete for scarce resources. Ref. |2| motivates this model in more detail. 
Various extensions of the model have also been introduced |3|]. 

The record of which side was the winning side for the last m time steps 
constitutes the history of the system. Agents analyze the history for the last 
m time steps to make a decision for the next time step. Hence, they have 
to be provided with a vector of length 2 m whose components are O's or l's. 
Each component is a decision corresponding to a particular history. Any such 
binary vector of size 2 m is called a strategy. See |2| and references herein for 
a more detailed discussion about the strategies used by the agents. 

Agents use a strategy at each time step. Each agent has at his disposal 
a fixed set of s strategies chosen at random, multiple choices of the same 
strategy being allowed. At each time step, an agent uses one of his strategies 
to make a decision. The chosen strategy is called the active strategy while, 
conversely, the s — 1 other strategies are called passive strategies. 
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The strategies, like the agents, are rewarded points according to their 
merit. A strategy earns a point each time it forecasts the winning side, 
irrespective of whether it is used or not. The points given to a strategy are 
called virtual points, they represent the points an agent could have earned 
had he always played with this strategy. Consequently, an agent always plays 
with his strategy with the highest number of virtual points, assuming that a 
strategy that has already won will win again. This assumption neglects the 
influence of the agent himself on the result. 

It has been shown numerically that the system exhibits a transition from 
an efficient to an inefficient phase , the transition occuring when the num- 
ber of different histories 2 m is of the order of the number of agents N. The 
parameter a = 2 m /N has been introduced to describe universality in the 
model. The transition from an efficient to an inefficient market takes place 
at a critial value a c . In the efficient phase, any information is promptly used 
by all the agents while arbitrage opportunities exist in the inefficient phase. 
That is, the virtual points given to the strategies contain information in the 
inefficient phase, which allows the agents to perform better on average than 
a system with agents guessing at random between the two sides. On the 
contrary, the agents are winning less points on average in the efficient phase. 

Although the model is quite simple, no exact analytical solution has been 
proposed, the main difficulty originating from the dynamical process through 
which agents choose their active strategy. In fact, a stochastic model with 
agents choosing their strategy at random among their set of s strategies has 
also been introduced 0. In this simplified case, all the dynamics of the model 
is included in the geometry of the space of strategies, and analytical results 
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using the Hamming distance between strategies are accessible. The solution 
of a closely related model has been proposed ||. This latter model coincides 
with the minority game in the inefficient phase and also exhibits a transition 
at a c = 0.33740... The aim of this paper is to address the dynamics of the 
choice of a strategy by the agents. 

2 The history distribution 

Consider a particular history h. Suppose that after t time steps, this history 
has happened n times. Among these n occurrences, side has won for no 
time steps and side 1 has won for ri\ time steps, with n$ + ri\ = n. Let us 
define Xh = |no — n\\ as a measure of the asymmetry between the two sides 
for the history h. If no = n\ for a history h, that is, Xh = 0, each side has won 
the same number of times. The virtual points of the strategies do not contain 
any information about this history and the two sides are indistinguishable. 
On the contrary, for Xh ^ 0, one side has won more often than the other and 
the symmetry between the two sides is broken. When Xh ^ 0, the agents try 
to play with the strategies that forecast the side that has won more often 
for h. This behaviour is restricted by the number s of strategies available 
to an agent and by the size of the memory m, because the virtual points 
contain information about all the histories. When m increases, the number 
of histories increases so that the agents are less and less able to retrieve 
information for a particular history. 

We define the history distribution D(x) as the probability that a his- 
tory chosen at random is associated to x. 2 m D(0) is the average number of 
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histories with x = 0. For these histories, the two sides are symmetric. If 
such a history happens, the system behaves like one where the agents choose 
between the two sides at random. 

Fig. 1 (a) and (b) present numerical results for D{ clS 8b function of 
a for agents having s = 2 strategies at disposal. For a < a c , most of the 
distribution D(x) is concentrated around the two values x = and x = 1. 
For these values of a, when a history h happens, agents guess at random on 
alternate time steps. On the other time steps when the history h happens, 
they try to go to the side that won the last time the same history happened 
[|]]. In this sense, the behaviour of the system is antipersistent JJ. Around 
a = a c , the agents are less and less able to efficiently retrieve information 
from the virtual points. For some histories the asymmetry between the two 
sides is large. In fact, for a > a c , we observe numerically that the history 
distribution is no longer stationary. Fig. 1 (b) presents numerical results 
for the average value of D(x) after 10000 time steps. If the simulations are 
continued for longer times, D(x) tends towards a flat distribution, every value 
of x being equally likely to occur. 

Assuming a continuous time evolution as well as a continuous variation 
of D(x) over non integer values of x, the history distribution D(x) for x > 2 
evolves as 

dD(x,t) d „ ld 2 D(x,t) 

_m = _ ((1 _V(»,t))i>(*,t)) + 5 -^i. (i) 

where p + (x) is the probability that if a history associated with x is selected, 
x will increase by one. The history distribution can reach a stationary state 
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only if p + < 0.5. The first term on the right hand side of the previous 
equation describes the local balance for the history distribution while the 
second term on the right hand side is equivalent to a surface tension. It 
smoothes the curve, but can be neglected if the distribution does not reach 
a stationary state. The equation for x = is 

2m ^r = ~ m + (1 " p+ ^ D ^- ^ 

and the equation for x — 1 is 

2m ^r = m ~ m + (1 " p + ^ D ^- ^ 

For p+(l) = 0, D(x > 2) = and D(0) = D(l) = 1/2, as observed for the 
smallest values of a. 

Fig. 2 (a) and (b) present numerical results for p + as a function x for 
various values of a, with 2 strategies per agent. For a <C a c , the system is 
particularly antipersistent, with p + going to as x is increased. For < a < 
a c , p + < 0.5 for the smallest values of x, typically for x ranging from 1 to 
10. When x is increased, p + — > 1, with huge fluctuations as reported on Fig. 
2 (a) for a = 0.3. As can be seen from Fig. 1 (a), just a few histories are 
associated with these high values of x. Hence, for < a < a c , the system is 
mainly antipersistent with some exceptions. For these particular histories, a 
large number of agents are not able to adapt, because both their strategies 
make the same prediction. Consequently, the symmetry between the two 
sides is broken. For a > a c , most histories are persistent, with p + > 0.5. 
In this case, the history distribution is no longer stationary, as mentioned 
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above. 

To test the stationarity of the distribution in the efficient phase, we 
present at Fig. 3 the relative difference between the number of histories 
N~(x) going from x to x — 1 and the number of histories N + (x — 1) going 
from x — 1 to x. b(x) is defined as 

[ > N-(x)+N+(x-l)' W 
Numerical results for b(x) are presented at Fig. 3 for a = 0.3 and s — 2. For 
the small values of x, b(x) ~ 0, showing that the distribution has reached 
a stationary state. On the contrary, for larger values of x, b(x) — ► — 1. 
This shows that for some histories, the symmetry is already broken before 
the transition. The increasing number of these kinds of histories as a is 
increased towards a c will destabilize the history distribution, finally becoming 
non- stationary at a c . Already for a < a c , the history distribution is not 
stationary for the higher values of x. 



3 Before the transition 

In this section, we restrict our attention to the low values of a, that is, to 
the values of a such as is approximately 0. This range of values for 

a will be refered as the low a phase, typically, a < 0.1. In this case, only 
histories with x = or 1 have to be considered. If there are k histories with 
x = 1, the strategies can be classified in decreasing order of virtual points 
[[?]], giving k + 1 different ranks. Strategies of rank have the highest number 



7 



of virtual points, while strategies of rank r have r less points, with r e [0, k}. 
There are 



strategies on rank r. p r (k) = K r (k)/2 2m is the probability to choose a strat- 
egy of rank r when picking a strategy at random in the strategy space. 

Comparing two strategies of rank 0, these strategies have always made 
the same prediction for any history with x — 1. Comparing a strategy of rank 
with a strategy of rank r, these strategies made the same predictions for 
all the histories with x = 1, except for r such histories. A reduced Hamming 
distance d* between two strategies a and a' can be defined as 

d * = zJ2\ a h- a 'h\ ( 6 ) 

K k&V 

where V is defined as the set of histories h with s^O. Then, any strategy at 
rank r is at a reduced distance d*(r) = r/k from any strategy of rank 0. This 
means that if a history with x = 1 happens, d*(r) is the probability that a 
strategy of rank r and a strategy of rank will make opposite predictions. 

As agents always try to use the strategy with the highest number of 
virtual points they have, there should be on average agents in group 
r > 0, with |, 



N 

r(0) / AT — 1 n o-fc 



r-1 



EwW - i-£pi(*)| (7) 

i=0 / V 3=0 



with iVo /A = 1 — (1 — 2 ) s . The superscript (0) refers to the first ap- 
proximation used for the classification of agents. In fact, the distribution 



of strategies according to their virtual points is somehow the result of the 
departures from uniformity present in the initial choice of strategies by the 
agents. Eq. (|7|) neglects this property. 

To go beyond the approximation, consider the average number of 
winners N w for agents playing at random. This is equal to 

N (AT -IV 

N w = — - N 1 . (8) 
2 2 N (^)l 2 

In this case, the probability to win per agent is r w = N w /N. If there are k 
histories with An = 1, there are on average 

agents that possess a strategy of rank r, if we suppose that the agents have 
never adapted. Then, we let the agents adapt if they have among their s — 1 
remaining strategies a strategy of a lower rank. The next approximation for 
the average number of agents on rank r is given by 



3=0 J \ 3=0 



. s-1 

r— 1 



X 



- i -!>*(*) I I- ( 10 ) 

3=0 J V j=0 



Note that A^" (1) = Q + (N - Q )(l - (1 - 2- fe )( s - 1 )). In this approximation, 
we suppose that the distribution of the strategies reflects the property of the 
active strategy of an agent, but does not contain any information about the 
s — 1 passive strategies. 
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With the notation introduced in this section, the variance of the number 
of agents at one side, a 2 , in the low m phase is given by 



k=0 



E 



k\(2 m - fc)! 



D(0) h (l - D(0)) 



2 m -k 



( 




( 



k k 



) 



X 



(Y, N rd* r - N/2) 2 + ]T N r d*{l - d* r ) 



(11) 



r=0 r=0 



In this expression, it is considered that every history is equally likely to occur. 
The frequency of occurence of a history with x = is equal to -D(O). For every 
such history, agents are guessing at random, giving a contribution to a 2 of 
N/4. Histories with x = 1 occurs with a frequency equal to D(l) = 1 — D(Q). 
For every such histories, agents are using the lowest ranked strategy they 
possess. Agents at rank r go to the side that lost more often for the history 
considered with a probability d*, so that on average N r d* agents are going 
to this side. The variance of this number is equal to N r d*(l — d*). 

The result of Eq. ([TT|) as a function of a is presented in Fig. 4 using 
N^ and N} 1 ', for agents having s = 2 strategies at their disposal. We 
have checked that we obtain similar results for a given value of a, using 
several values for iV and m. For a fixed value of N, Eq. ([II]) is a very 
slowly increasing function of m. The variance a 2 of the number of agents at 
one side obtained by numerical simulations are also reported for comparison, 
using m — 4. Numerically, a 2 /N is also slightly increasing with m for a fixed 
value of N. As can be seen, Eq. ( |T0D gives a better agreement with the 
numerical simulations, up to a = 0.1. 
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4 Conclusions 



In this paper, we introduced the history distribution as a complementary tool 
to the Hamming distance to investigate the dynamics of the minority game. 
We argue that the history distribution is useful to investigate the process of 
the choice of the active strategy of an agent. The properties of the history 
distribution are numerically investigated, showing that when the transition 
from an efficient to an inefficient market occurs, the distribution becomes 
non-stationary. 

An approximate analytical expression for the variance of the number of 
agents at one side is proposed for the efficient phase. It is only exact for very 
low values of a, typically, a < 0.1. We were able to show that the agents 
are not just picking their strategies at random in a ranking of the strategies 
according to their virtual points. In fact, the ranking is the result itself of the 
initial choice of the strategies by the agents. Hence, the information about 
the strategies possessed by the agents has to be retrieved from the ranking. 

This work can be seen as complementary to the work of Challet and 
Marsili ||. It supplies some analytical considerations in the efficient phase, 
where an analytical solution is still lacking. 
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Figure Captions 



Figure 1 — Numerical simulations for the history distribution D(x) as a 
function of a for agents with s = 2 strategies at disposal, (a) a c > a — 0.1, 
0.2 and 0.3; (b) a c < a = 0.4, 0.5 and 0.6. 

Figure 2 — Numerical simulations for the parameter p + as function of x 
for agents with s = 2 strategies at disposal, (a) p + for a = 0.3; (b) p + for 
small values of x and a = 0.1, 0.2, 0.3, 0.4, 0.5 and 0.6. Note the progressive 
disappearance of the dip in the curve as a is increased. 

Figure 3 — Numerical simulations for the stability of the history distribution 
for a = 0.3, s = 2. b is proposed as a function of x. The curve is flat for 
small values of x, but as can be seen in the inset, 6^—1 as a; is increased. 

Figure 4 — Variance a 2 of the number of agents at one side divided by the 
number of agents N, as a function of a for s = 2. Numerical results (•) 
from simulations of the system are compared to the analytical results. The 
analytical results for a 2 /N using the distributions (o) and (A) are 
presented. 
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